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ABSTRACT 

The first generation of large-scale chemical tagging surveys, in particular the HER- 
MES/GALAH million star survey, promises to vastly expand our understanding of the chem- 
ical and dynamical evolution of the Galaxy. This, however, is contingent on our ability to 
confidently perform chemical tagging on such a large data-set. Chemical homogeneity has 
been observed across a range of elements within several Galactic open clusters, yet the level 
to which this is the case globally, and particularly in comparison to the scatter across clus- 
ters themselves, is not well understood. The patterns of elements in coeval cluster members, 
occupying a complex chemical abundance space, are rooted in the evolution, ultimately the 
nature of the very late stages, of early generations of stars. The current astrophysical models 
of such stages are not yet sufficient to explain all observations, combining with our significant 
gaps in the understanding of star formation, makes this a difficult arena to tackle theoretically. 
Here, we describe a robust pair-wise metric used to gauge the chemical difference between 
two stellar components. This metric is then applied to a database of high-resolution literature 
abundance sources to derive a function describing the probability that two stars are of com- 
mon evolutionary origin. With this cluster probability function, it will be possible to report a 
confidence, grounded in empirical observational evidence, with which clusters are detected, 
independent of the group finding methods. This formulation is also used to probe the role of 
chemical dimensionality, and that of individual chemical species, on the ability of chemical 
tagging to differentiate coeval groups of stars. 

Key words: stars: abundances - Galaxy: open clusters and associations: general - techniques: 
miscellaneous (chemical tagging) - methods: statistical 



1 INTRODUCTION 

It has long been postulated that stars form in dense cl usters out 
of an overdensity of primordial gas and dust (e.g. see I Shu et al l 
Il987h . Evidence for this is seen in present day open clusters, but 
their numbers by no mea ns account f or the large stellar popula- 
tion of the Galactic disk. Ijanes et ^ l ll988l) suggest that a typi- 
cal lifetime of an open cluster in the disk is 100 Myr, after which 
their stellar components dissipate through dynamical interactions, 
explaining the fact that stars are not strictly observed in clus- 
ters. It is possible this dissipative action is seen in p rogress in so- 
called "moving groups" and young associations (e.g.lEggenlll994l ; 
lMontesetal]|200ll : IZuckerman & Song|l20o4 IXorres et alj|2008h . 
A small but significan t population of very old open cluste r s have 
since been identified jPhelps et all 11994 11 anes & Phelpsl [T994I : 
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|Friel','l995'), notably including NGC 6791 with an age of ~7.2 Gyr 
IfCaluzny & Rucins ki, 1995,), with the zone in-between filled by a 
wide range of ages l lXin & Dengll2005l) . Present day star formation 
within embedded molecular clouds, su ch as the Orion nebu la, com- 
pletes the circle of cluster evolution jLada & Ladal [2003h . These 
observations indicate that stars indeed have formed and continue to 
form in clusters through all epochs of Galactic evolution. 

Although the particulars of star formation in molecular clouds 
remain uncertain, the observed abundances of elements point to 
a successive chain of star formation, chemical enrichment of the 
interstellar medium, and subsequent formation events in a cloud, 
occurring over generations of stars. Most of the elements known 
to exist in the universe are pr ocessed in the very late stages or 
death throes of a star (e.g. see IWallerstein et al]ll997h . The ori- 
gin of the Fe-peak elements is believed to be primarily in type la 
supernovae, while the a-, and r-process elements are most likely 
predominantly generated within the violent explosions of core- 
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collapse (type II) supernovae jKratz et all |2007|) . The relatively 
more quiescent asymptotic giant branch (AGB) phase, in low- to 
intermediate- mass stars, is thought to be responsible for most of the 
s-processing l lKarakas & Lattanzioll2007l ; IBusso et all 1 199^ . and 
thus is the sole origin of the pure s-process elements and a contrib- 
utor of elements produced by both the s- and r-processes. The next 
generation of low mass stars, born out of this freshly enriched gas 
and dust cloud, will spend a significant amount of time on the main 
sequence and giant branches. In these stages, and at these masses, 
the nuclear processing and convection fueling the star are not suffi- 
cient to alter the photospheric abundances of those elements jlbenl 
[196% . Therefore, it is these groups of elements that form the basis 
of the technique that has come to be known as chemical tagging. 

The technique of chemical tagging, first proposed by 
lFreeman"& Bland- HawthornI l |2002t) . promises to identify clusters 
of stars, even after being scattered by the dissipative evolution 
of the disk, through patterns of abundances in these key ele- 
ments. To date, several important test have been made, confirm- 
ing that me mbers of ind ividual open clusters are chemically ho- 
mogeneous jPe Silva et al. . 2009, 2007a3, as well as less obvious 
kinematically l i nked stellar aggregates know n as moving groups 
jPe Silva et all l2007bl : iBubar & Kin"! |2010|) . The technique has 
not, to our knowledge, been applied on any scale to a random sam- 
ple of stars, but has been utilized to confirm or reject suspected 
member ship amongst kinematic groups in several recent stud- 
ies (e.g. Tabemero et alj 20121 : IPompeia et al.ll2oTll ; ICarretta et al.l 
l2012l : lDe' Silva et al Hoi l|). A particular challenge is the large num- 
ber of dimensions that a re required in order to guarantee un ique- 
ness across population s; iBland-Hawthom & Freemanl | |2004|) and 
iDe Silva et ^ ( l2007al) estimate between 10 and 15 elements, are 
necessary, given the number of formation sites and some assump- 
tions of abundance accur acy. These elements form the dimensions 
of the so-called C-space l lFreeman & Bland-Hawth"orrll2002h . The 
sample size of studies with sufficient abundances, however, remains 
small. 

The ambitious chemical tagging project known as the GALac- 
tic Archaeology with HERMES (GALAI^ survey, to be un- 
dertaken with the upcoming High E fficiency and Reso lution 
Multi-Element Spectrograph (HERMES iBarden et al.ll2010l) . will 
target a million disk stars at high-resolution in a relatively 
short time-span. The possibility of piecing back together some 
of th e presumably 10^ relic clusters dissolved in the Galaxy 
telan d-Hawthorn & Freemanl l2004h . could vastly expand our 
knowledge of the star formation history and structural and chemi- 
cal evolution o f the Galactic disk. T he Gaia-ESO Public Spectro- 
scopic Survey ( lGilmoreetan.l2012h . an on-going chemical abun- 
dance analysis survey, will target approximately 100000 field and 
open cluster stars in the Galaxy at high resolution, providing a rich 
dataset complementing GALAH. A very interesting prospect is the 
search for Solar siblings, or the birth cluste r of the Sun, and its ori- 
gin and possible journey within the Galaxy jBland-Hawthorn et all 
|201 Obi : [Batista & Femandesll2012h . The ability to tackle these are- 
nas is dependent on the ability to reliably distinguish compatible 
from incompatible chemistries (i.e. between two stars in such a sur- 
vey). 

We present here, for the first time, a metric that quantifies, in a 
consistent yet flexible manner, global abundance trends in terms of 
chemical tagging and show how a given population can be probed 
using this metric, along with a representative cluster sample, with- 
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out the need to define an arbitrary "level" of chemical homogeneity. 
We show how any stellar population can be chemically tagged, and 
how a quantitative level of confidence can be applied to its mem- 
bers to gauge the overall quality of a detection based solely on el- 
emental abundances. We further discuss some implications of this 
approach to carrying out large scale chemical tagging surveys such 
as GALAH. 



2 A METRIC TO PROBE THE CHEMICAL NATURE OF 
CLUSTERS 

On the subject of chemical tagging, there has been little discussion 
in the literature thus far on establishing quantitative differences 
in chemical signatures between different clusters and the stars 
within a given cluster. Several authors have obtained and analyzed 
high resolution data-sets on known cluster or mov ing group pop- 
ulation s with the aim of testing che mical tagging jPe Silva et all 
l2007a| [bl [2009l : lBubar & Kinall201(]|) with quite promising results. 
These studies have taken a qualitative approach to differences in 
chemical abundance patters. There has also been work recently 
on understanding the complexity and interrelation of t he chemical 
abundance space using Principle Component Analysis jTing et ^ 
l2012h . The goal there is to reduce the overall dimensionality to 
only the strongest variant dimensions. That study, however, was 
restricted to probing the chemical patterns between different pop- 
ulations, including open clusters, looking for the largest scatter in 
nucleosynthesis yields. There is still lacking a global characteri- 
zation of the differences in patterns in C-space, e.g. the level of 
homogeneity expected within a cluster is not clearly defined. 

The difficulty in conceptualizing the interplay between all of 
the dimensions of C-space, combined with the non-uniformity of 
the available abundance data, leads us to seek a description of the 
chemical difference between clusters that is both independent of 
the dimensionality of C-space, and itself one-dimensional. 



2.1 A metric for determining chemical difference 

We define a simple metric for quantifying the chemical difference 
between any two stars 



Nc 



LJC- 



Nc 



Where C is a particular chemical species out of the Nc species for 
which abundances have been determined, Ac is the abundance ra- 
tio of element C to Fe relative to solar (i.e. [X/Fe] in standard nota- 
tion), except when C is Fe, in which case it is the ratio of Fe to H; j 
and j are the two stars for which this metric is being computed; and 
ujc represents a weighting factor for an individual species. For this 
work, ujc is always equal to one, but is included in this definition 
for future investigations, when sufficiently large data-sets, which 
will allow its examination, become available. Simply put, 5c is the 
mean absolute difference of species C between two stars across all 
Nc chemical species. There are two reasons for the choice of this 
type of metric (a Manhattan type) as opposed to a Euclidean type. 
First and foremost, its use facilitates the separation of clusters in 
C-space, the ultimate goal of chemical tagging, and secondly, the 
value of 5c is more easily interpreted in the context of abundance 
measurements. Another advantageous property of this metric def- 
inition, due to the Nc denominator, is that when considering any 
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number of correlated elements in the summation, the value of 5c is 
independent of the number of number of elements, Nc- 

In this paper, the binned probability density distribution of 
Sc for a single population, normalized to its maximum per-bin Sc 
number density, shall be referred to as the rye distribution. When 
such a population contains only members of a cluster, i.e. a single 
group of coeval stars, chemical tagging theory posits that this distri- 
bution should occupy the low end of the difference spectrum, while 
populations derived from strictly distinct clusters will occupy the 
high end. The following two terms describe these different popu- 
lations and will be used throughout this text. Their definitions and 
uses should be understood as follows: 

• intra-cluster: All members of this group are confirmed mem- 
bers of a single open cluster, r/c will be built from those Sc com- 
puted between any two stars that satisfy the above definition and 
for which any stated conditions apply (e.g. minimum number of 
elements, etc.) 

• inter-cluster: Members of this group are confirmed members 
of any open cluster, rjc will be built from all those 5c computed 
between two stars that are not members of the same open cluster 
and for which any stated conditions apply. 

Figure [T] shows both intra- and inter-cluster rjc distributions 
for the clusters listed in Table [T] calculated with the condition that 
a minimum of 8 elements be used in the 5c computation (for a de- 
tailed explanation of this choice, see section[5]l. We note that the to- 
tal range of the inter-cluster distribution is much larger than that de- 
picted, however it is not instructive to display anything beyond the 
range of the intra-cluster distribution for the purposes of our anal- 
ysis. It is important also to note that, because we do not assume a 
cluster is represented by its mean abundances, the inter-cluster dis- 
tribution takes into account all scatter within a given cluster, across 
all chemical dimensions. This point is crucial to our analysis, and 
its potential usefulness in terms of a true chemical tagging experi- 
ment, in that we wish to determine the ability to distinguish individ- 
ual pairs of stars. Assuming perfect homogeneity amongst cluster 
members in this respect would likewise circumvent our treatment 
of the intra-cluster distribution. 

2.2 Open cluster abundance data 

Table [T] gives a summary of the literature elemental abundance 
database generated for use in this analysis. Each row in the table 
is a particular study, from the literature, of a single cluster as in- 
dicated. Basic information includes the cluster name, the number 
of stars for which abundances were derived (A'^*), the total number 
of element species derived (Nc), and the literature reference. The 
remaining columns in the table summarize the abundance data: for 
each element existing in the database, the number of stars for which 
abundances for that element were derived is listed. The numbers in 
this matrix are equal to or less than the corresponding A'^*, depend- 
ing on, for example, the range or quality of a particular spectrum. 

The table was generated first using sources listed in Table 12 
of|c arrera & Pancind ( 1201 ih . and then by a literature search for 
any other studies based on high-resolution data for open clusters 
with abundances of those elements listed in Table [T] Due to the 
sensitivity of computed abundances, especially when derived from 
equivalent widths (EWs), with respect to resolution, and in order 
to best match upcoming chemical tagging surveys, we only consid- 
ered studies whose typical resolution was better than R~28,000. 
We also only considered individual clusters which had at least four 
stars for which abundances were derived. This latter condition is 



o Ln 



o 



6j, (dex) 

Figure 1. The rjc distributions for intra- (red; left) and inter-cluster (blue; 
right) populations. The total range of the inter-cluster distribution is larger 
than shown, and has been cut off as described in the text. 



Table 2. Sources and designations used in Table[T] 



Source reference 


Label 


Soderblom et al. (2009) 


A 


D'Orazi & Randich t2009) 


B 


Ford et al. (2005) 


C 


Sestito et al. (2008) 


D 


Shen et al. (2005) 


E 


Braaaglia et al. (2008) 


F 


Pace et al. (2008) 


G 


Maarini et al. (2010) 


H 


De Silva et al. (2007a) 


I 


Randich et al. (2006) 


J 


Carretta et al. (2007) 


K 


Gonzalez & Wallerstein (2000) 


L 


Smilianic et al. (2009) 


M 


De Silva et al. (2006); Paulson et al. (2003) 


N 


Carretta et al. (2005) 


O 


^reira & Ouireza (201ff) 


P 


Mitschang et al. (2012) 


Q 


^ebran et al. (2008) 


R 


G^ebran & Monier (2008_) 


S 


Tautvaisiene et al. (2000) 


T 


Reddv et al. (2012) 


U 



an optimisation between the total number of clusters included and 
the number of pairs used for the intra-cluster distribution, ensuring 
adequate statistical significance for both populations. There proved 
to be no reason to limit the number of chemical species derived in a 
particular study (Nc)', indeed the range allows us to probe some of 
the effects of chemical dimensionality on the technique of chemical 
tagging, which will be discussed in more detail in Section|5] 

All abundance values collected in the database are with re- 
spect to Fe and compared to solar, i.e. [X/Fe] in standard nota- 
tion. Fe abundances are the only exception and are tabulated as 
[Fe/H]. Where a literature study reported abundances otherwise, 
those values were converted to this system when possible, and oth- 
erwise omitted. Care was taken to ensure that the studies targeted 
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Table 1. Summary of the high-resolution literature abundance database 



Cluster Source" Af* Nc Na Mg Al Si Ca Sc Ti V Cr Mn Fe Co Ni Cu Zn Sr Y Zr Ba La Ce Nd Sm Eu 



® 



Pleiades 


A 


20 


5 


20 






20 


- 


- 


20 


- 


- 


- 


20 


- 


20 


- 


- 


- 


- 


- 


- 


- 


- 


- 


- 


1C2391 


B 


7 


6 


7 


■ 


- 


7 


7 


- 


7 


- 


- 


- 


7 


- 


7 


- 


- 


- 


- 


- 


- 


- 


■ 


- 


- 


1C2602 


B 


8 


6 


8 


- 


- 


8 


8 




8 








8 




8 










- 












Blanco 1 


C 


8 


6 


- 


8 


- 


8 


8 


- 


8 


- 


- 


- 


8 


- 


8 


- 








- 












Be29 


D 


6 


6 


- 


- 


- 


6 


6 


- 


6 


- 


6 


- 


- 


- 


6 


- 


- 


- 


- 


6 


- 


- 


- 


- 


- 


Cr261 


D 


7 


6 




- 


- 


7 


7 




7 




7 








7 










7 












Mel66 


D 


6 


6 


- 


- 


- 


6 


6 


- 


6 


- 


6 


- 


- 


- 


6 


- 


- 


- 


- 


6 


- 


- 


■ 


- 


- 


1C4665 


E 


18 


6 




18 




18 


18 




18 




18 








18 










- 












Be32 


F 


9 


8 


9 


- 


- 


9 


9 


- 


9 


- 


9 


- 


9 


- 


9 


- 


- 


- 


- 


9 


- 


- 


- 


- 


- 


NGC2324 


F 


7 


8 


7 


- 


- 


7 


7 




7 




7 




7 




7 










7 












NGC2477 


F 


6 


8 


6 


- 


- 


6 


6 


- 


6 


- 


6 


- 


6 


- 


6 


- 


- 


- 


- 


6 


- 


- 


■ 


- 


- 


NGC2660 


F 


5 


8 


5 


- 


- 


5 


5 




5 




5 




5 




5 










5 












NGC3960 


F 


6 


8 


6 


- 


- 


6 


6 


- 


6 


- 


6 


- 


6 


- 


6 


- 


- 


- 


- 


6 


- 


- 


- 


- 


- 


1C465 1 


G 


5 


8 


5 


- 


5 


5 


5 


- 


5 


- 


5 


- 


5 


- 


5 


- 


- 


- 


- 


■ 


- 


- 


■ 


- 


- 


M67 


G 


6 


8 


6 


- 


4 


6 


6 


- 


6 


- 


6 


- 


6 


- 


6 


- 


- 


- 


- 


- 


- 


- 


■ 


- 


- 


Praesepe 


G 


7 


8 


7 


- 


5 


7 


7 




7 




7 




7 




7 










- 












NGC6192 


H 


4 


8 


- 


4 


4 


4 


4 


- 


4 


- 


4 


- 


4 


- 


4 


- 


- 


- 


- 


- 


- 


- 


- 


- 


- 


NGC6404 


H 


4 


8 




4 


4 


4 


4 




4 




4 




4 




4 










- 












Cr261 


I 


12 


9 


12 


12 


■ 


12 


12 


- 


- 


- 


- 


12 


12 


- 


12 


- 


- 


- 


12 


12 


- 


- 


■ 


- 


- 


M67 


J 


10 


9 


10 


10 


10 


10 


10 




10 




10 




10 




10 










" 












NGC3532 


M 


5 


9 




5 




5 


5 


5 


5 


5 


5 






5 


5 






















NGC4756 


M 


5 


9 


- 


5 


- 


5 


5 


5 


5 


5 


5 


- 


- 


5 


5 






















NGC5822 


M 


5 


9 




3 




5 


5 


5 


5 


5 


5 






5 


5 






















NGC6791 


K 


4 


9 




4 


4 


4 


4 


4 


4 






4 






4 










4 












Mil 


L 


4 


10 


4 




4 


4 


4 


4 


4 










3 


4 












4 








4 


NGC6253 


K 


4 


10 


4 


4 


4 


4 


4 


4 








4 






4 










4 










4 


Hyades 


N 


46 


12 


46 


46 




46 


46 




46 








46 








46 - 




46 


46 


18 


46 


46 






Cr261 


O 


6 


13 


6 


6 


6 


6 


6 


6 


6 




6 


6 


6 


6 


6 










6 












NGC3114 


P 


7 


13 




5 


6 


7 


7 


7 


7 




7 








7 






5 


6 




7 


6 


6 






NGC3680 


Q 


11 


14 


11 


11 


8 


11 


11 




11 




11 




11 




11 




11 


11 




8 


7 




8 






Berenice 


R 


11 


15 


10 


11 




6 




6 


11 


11 


10 


10 


11 


11 


11 




6 


6 


11 


11 












Pleiades 


S 


5 


16 


5 


5 




5 


5 


5 


5 


5 


5 


5 


5 


5 


5 




5 


5 


5 


5 












M67 


T 


9 


20 


9 


9 


9 


9 


9 


9 


9 


9 


9 


9 


9 


9 


9 


9 




9 


9 


9 


9 


9 






9 


NGC2360 


U 


4 


21 


4 


4 


4 


4 


4 


4 


4 


4 


4 


3 


4 


4 


4 


4 


2 


4 


4 


3 


3 


2 




3 




NGC752 


U 


4 


23 


4 


4 


4 


4 


4 


4 


4 


4 


4 


4 


4 


4 


4 


4 


3 


4 


4 


4 


4 


4 


4 


3 


4 



> 

c/5 ° References using designations listed in Table|2] 
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Table 3. Literature abundance database columns format t 



Cluster 


CID 


Star 


SID 


Src° 


El 


[X/Fe]* 


EiTor 


NGC2360 


50 


12 


1046 


U 


Al 


-0.01 


0.05 


NGC2360 


50 


12 


1046 


U 


Ca 


0.13 


0.06 


NGC2360 


50 


12 


1046 


U 


Co 


0.03 


0.09 


NGC2360 


50 


12 


1046 


U 


Cr 


0.02 


0.05 


NGC2360 


50 


12 


1046 


U 


Cu 


-0.13 





intra-cluster distribution may also be attributed to noise, perhaps 
in the form of mis-identified group members. Another possible ex- 
planation for these "outliers" is related to errors in their abundance 
measurements. If not a noise feature, it may represent some form of 
cloud fragmentation in the progenitor of those clusters, giving rise 
to multi-modal star formation efficiencies within the same molec- 
ular cloud. In any case, as we shall demonstrate, neither of these 
features affects the probability analysis in the rest of this paper to 
any significant degree. 



t The complete table with a detailed description of its contents is available 
in the electronic version of this text. 
" As in TableU] 
Except for Fe -i> [Fe/H] 

unevolved stars. Uncertainties, where available, are included in the 
database and are used to inform typical values as discussed later in 
the text. 

The final database with individual abundances, which is avail- 
able as a machine readable table with the online version of this text 
and on the we contains 35 individual cluster studies of a total of 
30 distinct clusters (clusters from separate studies are listed sepa- 
rately in Table[TJ from 22 literature sources (those listed in Table[2l(. 
There are a total of 2775 individual abundance values measured for 
291 stars, with a minimum and maximum chemical dimensionality 
of 5 and 23, respectively. A stub of the database is given in Table 
[3] where CID is a unique identifier for a given cluster in a given 
study, and SID is a unique identifier for a given star. These iden- 
tifiers make it simple to disambiguate stars with repeated names 
and clusters from different studies. The complete version of Table 
[3] is available both online and in the electronic version of this pa- 
per. The tabulated abundance values from the literature were parsed 
via a computer program insofar as possible, and by hand in some 
cases where the formats were not as uniform. In very rare cases a 
literature study that matched our criteria may have proved to be too 
difficult to parse and load (e.g. a scanned hard copy) and had to be 
omitted. 

2.3 The intra- and inter-cluster i]c distributions 

Upon inspection of Figure [T] it is evident that the two populations 
have peaks significantly separated in Sc space. Each peak can be 
thought of as the "characteristic" chemical difference of that pop- 
ulation. In other words, based on Figure [T] we find that Galactic 
open clusters, as represented by our literature abundance database, 
typically differ chemically by ~0. 120 dex, while open clusters' in- 
ternal scatter is typically at the ^^0.045 dex level. Next we draw 
the readers attention to the crossing point of the two distributions 
at ~0.07 dex. This represents the point at which a pair of stars 
would be equally likely to inhabit the same cluster as they would to 
be members of entirely different clusters. Pairs of stars whose Sc 
computes to the right of this line are then more likely than not to be 
of distinct origin, while those whose 5c computes to the left would 
be considered more likely to be siblings. 

Another obvious feature is the bump in inter-cluster 77c close 
to 0.19 dex. This is most likely due to noise on account of the lim- 
ited numbers of clusters in this work and would smoothe out if 
more data were available. The extended tail toward high Sc in the 

^ Note: To be uploaded to Vizier 
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3 SYSTEMATICS 

Systematic uncertainties folded into abundance determinations 
come in many forms. Among these are the methods used in com- 
puting EWs (and those used to determine continuum levels); the 
choice of atmospheric models including assumption of Local Ther- 
modynamic Equilibrium (UrE)/non-LTE calculations and convec- 
tive overshooting; the synthetic spectrum codes used; and the solar 
abundance values used to compute relative abundances, to name 
just a few. In general, an individual study will use identical tech- 
niques and parameters relating to these factors in order to maintain 
internal consistency. As such, we do not believe that systematic ef- 
fects significantly alter the intra-cluster distribution computed from 
our database from the "natural" one. 

The story for mfer-cluster comparison is much more com- 
plicated. Experience has shown that even with high quality high- 
resolution data, the latest models and codes and very careful analy- 
sis, slight differences in approach between studies can yield sig- 
nificantly different results. Assuming an "intrinsic" inter-cluster 
rjc with some variance and mean Sc, introducing systematics will 
cause a broadening and a shift in the mean. Searching for an es- 
timate of the intrinsic distribution, such that adding a systematic 
treatment will result in an adequate representation of the data, is un- 
realistic given the complexity of inter-cluster pairs in the database. 
Systematics, for these purposes, should only affect pairs made from 
different literature sources. The differing number of clusters and 
stars per literature source also makes any simplified treatment im- 
possible. To gauge the magnitude of both systematic effects, the 
mean shift and broadening, for the specific case of our database, 
we induce uncertainties in the data and extrapolate backward to a 
best guess of the intrinsic distribution. 

To accomplish this a Monte-Carlo type of approach was taken, 
where in each simulation iteration, we added randomly derived un- 
certainties to the abundances of similar elements for all unique 
sources. In other words, the same random value was added to all 
the measurements of a given chemical species in a given litera- 
ture study, regardless of cluster membership. The absolute values 
of our randomly derived uncertainties were taken from a Gaussian 
distribution whose parameters fit the distribution of uncertainties 
recorded in the database. Figure |2] shows both the literature un- 
certainties and our Gaussian model, which provides a reasonable 
fit to the data. The inter-cluster rjc is then computed from this set 
of simulated cluster abundances and can be compared to the true 
inter-cluster rjc from our database. 

We computed 1000 iterations of the systematics simulation de- 
scribed above and derived rjc across all simulations. The resultant 
distribution is shown in the left panel of Figure[3](grey), along with 
the data (blue), which have been shifted to align the peaks. The 
broadening due to simulated abundance uncertainties is evident. To 
reverse this process, or de-broaden, the difference between simula- 
tion and data is removed from the data to arrive at the green his- 
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Figure 2. Gaussian fit (red) to the normalized distribution of uncertainties 
reported in tlie literature (black histogram). 

togram in Figure [3] left panel. The grey cross-hatching illustrates 
the total change between simulations, data and the de-broadened 
distribution. Addition of the simulated uncertainties caused the 
mean 5c in the resultant distribution to be shifted to a 5c 56% 
larger than that of the original distribution. When corrected for the 
de-broadened distribution, in order to arrive at an estimate of the 
intrinsic distribution, we find the 5c for which a 56% increase ar- 
rives at the data mean, this results in a peak at 5c of ~0.077 dex 
shown in the center panel of Figure[3] which is a repeat of Figure[T] 
using the "intrinsic" inter-cluster rjc instead of the data. 

This is perhaps a crude approximation, but should nonetheless 
give an idea of the magnitude of potential effects. The uncertain- 
ties used, for example, are internal random errors computed in a 
variety of ways and may not represent the sources of error between 
studies as well. Specific considerations, however, are beyond the 
scope of this work. The bulk shift of our de-broadened distribution 
likely mis-represents the true rising edge. Specifically, the first two, 
perhaps three, bins shown in the center panel of Figure[3]are proba- 
bly artefacts of this. Another artefact is visible in the simulated ric 
in the left panel of Figure [3] at a 5c of ^^0.05 dex. The origin of 
this feature is unclear, but, given its location, it does not affect our 
analysis in any meaningful way. 

Internal, random, errors in abundance measurements will also 
alter the rjc distributions from their intrinsic profiles. Depending 
on the overlap between clusters in C-space, the magnitude of this 
effect will be different for either population; however, the scatter 
induced in the intra-cluster population requires that its magnitude 
be greater. This implies that the overlap between rjc distributions 
seen in Figure [3] is an absolute upper limit, and the peaks would 
be separated by a greater distance in 5c, favoring the distinction of 
clusters. The specific impact that systematic effects, as discussed 
here, have on our conclusions in this work will be discussed in fur- 
ther detail in the next section. 



4 THE CLUSTER PROBABILITY FUNCTION 

A major goal of this work is to derive a quantitative assessment of 
the ability to distinguish coeval stars from those born in distinct en- 
vironments, based solely on their chemical signatures. The level of 
homogeneity that is expected within a Galactic open cluster, and the 
sparseness of the chemical space between them, are readily inferred 
from Figure[T] We now combine these two concepts to achieve this 
goal. The probability that a particular pair of stars are members of 
the same cluster based on their 5c is related to the relative contri- 
butions of each ric distribution, and can be expressed as 



1 1 1 1 
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Figure 4. The intra-cluster probability function. The black squares and his- 
togram show the discrete probability function for the same r]c distributions 
shown in Figure[T] The solid red line shows the least squares fit to this using 
the continuous function with parameters listed in TableO where the dashed 
cross-hairs represent the 68% confidence hmit that a pair of stars are co- 
eval. Shown in green is the continuous fit for the "intrinsic" inter-cluster 
rjc derived from our systematics considerations. 
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When computed for the specific rjc distributions plotted in Figure 
[T] we obtain the discrete probability function shown in Figure |4] 
where the cross-hairs represent the 68% confidence limit. Note the 
steepness of the function; by the time it reaches a 5c of 0. 1 dex, 
there is only approximately a 10-20% probability that those two 
stars are members of the same cluster. This places tight constraints 
on the performance of a chemical tagging survey (e.g. see Section 
[6j. 

4.1 A functional form of P^^ 

We now seek a functional form to express the intra-cluster prob- 
ability continuously as a function of 5c- A modified form of the 
sigmoid function with four free parameters describes the data well 
and can be written as 

1 - c 



prob(5c) = c + ^ , , , 

The c term accounts for the lower probability limit caused by the 
tail in the intra-cluster distribution, while the 1 — c term ensures 
that we do not obtain a probability greater than 1 . The solid red line 
in Figure |4] shows the least squares best fit to the discrete probabil- 
ity function. The goodness of fit illustrates both the smooth nature 
of the rjc distributions, and the appropriateness of the continuous 
probability function. 

Also shown in Figure |4] is a fit to the probability function 
from our derived "intrinsic" inter-cluster rjc (see Section|3j. This fit 
(shown in the right panel of Figure [3] along with its discrete coun- 
terpart) assumed that the first bin, corresponding to a 5c of zero, or 
identical chemistries, actually approaches one. Likewise, the sec- 
ond bin was ignored, both of these under the assumption that they 
are noise features arising from our crude systematics treatment. 

The magnitude of possible systematic effects is quite visible. 
At the 68% confidence level the difference between the two func- 
tions is ~0.015 dex. It is difficult to judge whether our "intrinsic" 
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Figure 3. Summary of simulated inter-cluster systematics treatment. The left panel shows the rjc from 1000 systematics simulations (grey line), along with the 
peak-shifted data (blue) and the de-broadened and normalized "intrinsic" distribution (green). The center panel shows the "intrinsic" distribution peak-shifted 
to smaller Sc based on the shift from our simulations. The rightmost panel shows the probability function (see Section|4) derived from the "intrinsic" r]c, and 
fit assuming the first two bins are an artefact of this crude treatment. 



Table 4. Best fit parameters for Pg^ 



Parameter Data Systematics simulation 

a 13.37 9.164 

b 65.51 64.41 

c 0.041 0.148 

d 7.060 5.242 

function actually represents such, but Figure |4] does help to illus- 
trate the advantages of homogeneous abundance analysis, even if 
only to avoid the need for such considerations as applied here. The 
four parameters corresponding to each fit are listed in Table|4l 

This formulation now provides us with a way of not only fil- 
tering the results of cluster finding algorithms, but importantly, a 
means to report the confidence with which we detect them. There 
will be a rate of attrition due to the overlap in the respective distri- 
butions, an effect which is likely due to natural scatter in addition 
to complex and not well understood processes occurring in the pro- 
genitors of all open clusters. We stress the importance of arriving 
at this probability function through empirical analysis of observa- 
tional data. Until we can be confident that we fully understand all 
of the intricacies of star formation and Galactic chemical evolution, 
this empirical approach will remain the only way to achieve this. 

This is not to say that the parameters listed in Table |4] are 
not subject to change. These most likely represent a lower limit 
to the ability to chemically tag, due to the heterogeneous analysis 
methods present in our abundance database. In the case of a large 
scale chemical abundance survey such as GALAH, or Gaia-ESO, a 
probability function specific to the analysis methods of that survey 
would be derived - so long as an appropriate calibration program 
targeting a number of established coeval populations were adopted. 
The analysis described in the present paper would be repeated using 
cluster abundance data from that calibration program. The Gaia- 
ESO survey serendipitously has this calibration built into its de- 
sign, since it aims to target 100 Galactic open clusters. Possibly, 
though not ideally, this could serve as calibration for GALAH. For 
heterogeneous chemical tagging experiments (without the benefits 
of a uniform abundance analysis), the most up-to-date abundance 
data would be added to the literature abundance database and a new 



probability function derived. In such a way, the function parameters 
could be iteratively improved. In the meantime, the function with 
parameters listed in Table|4]represent our current best guess, within 
the limitations of the abundance data in our database. 



5 EXPLORATIONS IN C-SPACE 

Up to this point, all our discussion has been involving the 77c dis- 
tributions whose 5c metric computations were required to have at 
least a dimensionality of eight (i.e., a minimum of eight distinct el- 
ements being compared). This is not an entirely arbitrary decision, 
but one that is based both on the practicality of future chemical tag- 
ging experiments, and a delicate balance between optimising the 
cluster probability function and maintaining the statistical signifi- 
canc e of the distributions themselves. 

iBland-Hawthorn & Freeman! | |2004|) estimated the minimum 
number of dimensions required to reveal a fraction of the approxi- 
mately 10** clusters believed to be dissolved in the Galaxy. This fig- 
ure, between 10-15 elements, has been incorporated into the design 
of the HERMES in strument for the p urpose of the GALAH chem- 
ical tagging survey. iTing et al] ( l2012h confirm this number finding 
approximately 9 independently varying elements from the available 
abundance data. The constraint employed here matches well these 
capabilities, in fact, the mean number of elements in the intra- and 
inter-cluster rjc distributions came out to be 11 and 9, respectively. 
In other words, out of the total number of 5c pairs in each distri- 
bution, half were computed with greater than, or equal to, 1 1 and 9 
compatible element abundances. It is important, also, to understand 
the species that dominate each of the distributions. These will be 
the minimum set of species required to utilize the cluster probabil- 
ity function with the parameters listed in Table |4] In Figure [5] we 
plot the number of 5c computations, per element, for both inter- 
and intra-cluster rjc ■ The 9 and 1 1 key elements for those two can 
clearly be seen in the upper portion of Figure[5] these are dominated 
by the a and Fe-peak elements, in addition to Ba, due to their rel- 
ative ease of measurement. We stress that these do not necessarily 
represent the best chemical tagging discriminators, but rather that 
the high ranking elements shown are capable of chemical tagging 
at the level depicted in Figure|4l based on the data available. 
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Figure 5. The numbers of 5c computations making up eacli distribution 
shown in Figure [T] The "ranking" increases from top to bottom, where a 
lower number rank indicates greater significance in the overall distribution. 
This is not a reflection of the significance of any individual element as a 
chemical tagging probe, but rather informs the groups of elements generally 
necessary to utiHse the cluster probability function described here. 
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Figure 6. The effects of dimensionality on the probability function. The 
top panel shows discrete and fit continuous probability functions for both 
the minimum 8 dimensions (upper;red) and no limit (lower;blue dotted) rjc 
distributions. The bottom panel shows residuals computed from P^^ ~Ps^ 
for data (black histogram) and continuous representations (red line). The 
strongly positive residuals toward the intra-cluster portion (left-hand side) 
of the plot clearly speak to the necessity of a larger number of dimensions. 
Cross-hairs are as in Figure|4] 



5.1 Probing chemical dimensionality 

The role of dimensionality on chemical tagging can be tested using 
the formulation presented here. By relaxing the requirement that 
a minimum number of species be present in each 5c computation 
we can include the entire database summarized in Table [T] which 
results in a minimum of 2 and mean of 5.6 dimensions. Figure |6] 
shows the probability functions derived from their rjc distributions, 
including continuous fits, and the residuals (both discrete and con- 
tinuous) obtained by the difference Pf^ — Pg^ . It is evident from 
the excess in residuals at low Sc that the reduction in dimensions 
results in a significant loss in the ability to perform chemical tag- 
ging. Our ability to explore the same effects for higher dimensions 
is trumped by the limited sample of studies with a wide range of 
element abundances. The sharp edge seen in both panels of Figure 
[5](at rank 8 and 10 in left and right panels, respectively) illustrates 
this. The numbers of pairs dramatically reduce and noise immedi- 
ately dominates our distributions. For example, with a minimum of 
10 dimensions imposed, the number of inter-cluster pairs reduces 
by 80% and the number of clusters being probed cuts almost in 
half, from 27 to 14. 

Although it is fairly intuitive at this point to conclude that in- 
creases in chemical dimensionality lead to improvements in the 
probability function, it must be noted that improvements, if any, 
would not necessarily be linear or predictable. The difficulty in de- 
riving abundances for some elements (e.g., weak lines, few lines 
for EWs, etc.) at a given resolution could yield greater uncertain- 
ties and potential scatter, resulting in little improvement. Indeed, 
the limitations on probing higher dimensions in our literature abun- 
dance database likely are a result of this. Furthermore, elements 
whose abundance trends are coupled (e.g., see next section) would 
not singularly lend a comparable improvement over individual un- 
coupled elements. 



5.2 Probing specific elements 

It is now clear that a number of chemical dimensions is required 
to differentiate clusters in the Galaxy. Each dimension, however, 
defined by the efficiency of its nucleosynthesis in the previous gen- 
eration of stars, need not contribute evenly to the overall distribu- 
tion. Elements whose processes are relatively insensitive to stellar 
mass, for instance, will be generated in most cases and the amount 
may only depend on a few parameters of the progenitor cloud. We 
explore the role of individual species in the probability function by 
recomputing the Sc metric for each pair, ignoring that species. The 
probability function computed from the resulting distributions is 
then compared to that from the originals. Figure |7] shows the resid- 
uals obtained by subtracting the new function from that depicted in 
Figure|4](red line) and Table|4l for a range of elements with a high 
ranking (as from Figure|5](. Each panel also plots a linear regression 
to the residuals in the range from to 0.25 (beyond this range the 
functions become extremely noisy and mostly meaningless), in or- 
der to illustrate the general trends. In these plots, a positive residual 
can be interpreted to imply that the element in question contributes 
above average to cluster differentiation, i.e., abundance homogene- 
ity is preserved within a cluster while the abundance dispersion 
between clusters is increased for that element. This is particularly 
true at the low end of the Sc spectrum (i.e. intra-cluster probability 
greater than approximately the 68% level) where chemical tagging 
is possible, marked by the vertical dashed line in Figure|7] 

Typically, for the a and Fe peak elements probed, the level of 
influence is average to within the noise, at less than about 1-2%. 
This is not to say that they are poor chemical tagging elements, 
only that they act in concert, providing the robustness and noise re- 
duction of greater numbers of dimensions. The a panel of Figure|7] 
exemplifies this point, where here the combination of Na, Mg, Al, 
Si and Ca acts as a proxy for the a elements. Although the resultant 



© 0000 RAS, MNRAS 000, 000-000 



Quantifying Chemical Tagging 9 



<] 



<] 



O - 



o - 



o - 



o 



o 



o 




6^ (de> 



6^ (dex) 



6^ (dex^ 



6^ (dex^ 



Figure 7. Examining tlie role of chemical tagging species in the cluster probability function. Each panel shows the difference residuals between the function 
derived in the text and one derived with the particular element, as indicated, ignored in the Sq metric computation. For most of the elements listed, the effect 
is quite small and generally hnear. The negative slope, and positive residuals residuals in the low end of the plot without Ba, indicates its power as a chemical 
tagging discriminator. 



rjc must suffer from an increase in noise due to diminishing num- 
bers of dimensions (the total number of pairs is still preserved), it 
is quite clear that the effect on the cluster probability is significant, 
at greater than the 10% level throughout intra-cluster regime. 



Fe and Si seem to be exceptions, contributing closer to the 
3-4% level on their own. The heavy n-capture element Ba single- 
handedly alters the cluster probability by close to 8%. Ba can be 
produced both throug h the slow and rap id n-capture processes, in 
low mass AGB (e.griBusso etal.ll200ll) and core-collapse super- 
novae (e.g. lXratz et al. 2007 ). respectively, giving a wide range of 
conditions owing to the observed scat t er in disk field and betwee n 
Galactic open clusters jP'Orazi et all l2009l: [Sensbv et all l2005h . 
The dependence on age found bv lD'Orazi et al.l l l2009l) may also 
be related to its power as a cluster discriminator. Different clusters 
evolving on different time-spans will exhibit distinct efficiencies 
in Ba production, which are then imprinted on the new forming 
cluster stars. These possibilities imply that other elements, such as 
La, Nd and Eu, that form through similar processes, may lend to 
a significant improvement to the Pg^, ; however, due to the limited 
availability of their abundances in the literature our sample does 
not contain sufficient numbers to explore these effects further. We 
must not discount that non-LTE effects, which may lead to sig- 
nificant overestimate of Ba abundances in young stars, could be 
responsible for some of its perceived power. 



There are many factors conspiring to make meaningful com- 
parisons for other elements difficult, not least of which is the stark 
drop off in numbers of pairs for elements beyond those in Figure 
|7] Provided that there are proper calibrations against known Galac- 
tic open clusters, upcoming large scale abundance surveys, such as 
GAL AH, will be highly optimised for these types of investigations. 



6 CLUSTER RECONSTRUCTION 
6.1 Mock chemical tagging experiment 

We have already touched on the fact that the overlap between the 
intra- and inter-cluster rjc implies that clustering detection effi- 
ciency can not be 100%. In order to maximise the confidence with 
which we report detections, we must sacrifice potential members 
in lieu of including non-members. The actual efficiency which can 
be achieved is a function of the area of the overlapping regions 
between intra- and inter-cluster rjc (see Figure [TJ, and the confi- 
dence limit, Piim, set on detections (e.g. 68, 90, 98%). In order to 
establish how said overlap affects the recovery of clusters, and to 
demonstrate a practical application of the probability function to 
a group of cluster candidates, we have performed a blind chemical 
tagging experiment using the abundance database discussed herein. 
A brief description of the method used follows. 

Sc was computed for each distinct pair of stars, ignoring our 
a-priori knowledge of cluster membership, thus emulating a true 
chemical tagging survey. We then identified all pairs with a proba- 
bility Ps^ greater than Pum, where Pum was evaluated at 68, 85, 
and 90%. Starting with the most significant pair of this subset, stars 
were added to groups where all pairs in the resulting group have at 
most a 5c corresponding to Pum- After addition to a group, a star 
was removed from the subset of significant pairs. When no more 
stars could be added to the current group, the next most signifi- 
cant pair was identified and this process repeated. Grouping was 
then completed when all pairs of the subset were exhausted. In this 
method all pairs of members of a candidate group have at most 
5c corresponding to Pum, as opposed to the group mean. This re- 
striction ensures we identify the core of a cluster and limits the 
runaway in linkages that would otherwise occur. This is not partic- 
ularly sophisticated, and there may be more optimal methods such 
as evolving the cluster mean as members get added, but it does 
ensure a strict and invariant interpretation of Pum for our simple 
experiment. 

Table [5] summarizes the results of our experiment for Pum of 
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Table 5. Summary of chemical tagging test experiments 



p 

^ l im 


N 


Detection Rate 


Contamination Rate 


Fragmentation Rate 


Mean A^detect 


Mean efficiency 


68 


16 


59% 


50% 


25% 


7.50 


42% 


85 


20 


74% 


50% 


50% 


4.85 


25% 


90 


9 


33% 


22% 


57% 


4.67 


19% 



68, 85 and 90%. Here, our definition of a cluster requires at least 
three members (e.g., a single high significance pair would not be 
considered a cluster), and a "clean" detection is defined as one 
where all recovered group members reside in the same true clus- 
ter (but do not necessarily comprise the whole true cluster). A brief 
description of the columns of Table[5]follows. N gives the number 
of clusters detected in total, the Detection Rate compares N to the 
number of true clusters in the experiment, which is 27. The Con- 
tamination Rate is a measure of the overall quality of detections and 
is the fraction of all detections with members from more than a sin- 
gle true cluster (e.g. not a clean detection). The Fragmentation Rate 
is the fraction of clean detections that are repeats, a high value of 
which represents an over-sensitivity to internal cluster scatter. For 
example, the 57% fragmentation rate in Pum = 90 comes from the 
fact that out of the 7 clean detections, 5 were made from stars of the 
same Hyades study, making 4 repeats. Ndetect is simply the mean 
number of stars in each group, while the mean efficiency measures 
the recovery rate of clean clusters, computed as the number of de- 
tected members divided by the total number of true members. 



6.2 Efficiency and constraints 

Table |5] illustrates some of the challenges of chemical tagging and 
gives an idea of what to expect in terms of efficiency and con- 
tamination. It should be noted, however, that the efficiency tabu- 
lated may not be particularly faithful to a true experiment. Many of 
the literat ure studies have low n um bers of stars, while th e Hyades 
sample of lOe Silva et alj | |2006|) and lPaulson et alj contains 
46 members. The stark contrast with little middle ground implies 
that our numbers likely represent upper bounds, since few clusters 
would be expected to have only several members, although it is un- 
clear how Galactic dynamical effects, such as disk mixing, spiral 
scatter, and radial migration would affect the number of original 
cluster members expected to be in the scope of a chemical tagging 
survey. The fragmentation rate is also dominated by the Hyades, 
primarily due to our requirement of at least three candidate mem- 
bers per detection, and may represent a lower bound. 

It is important to understand that the efficiency tabulated in Ta- 
ble |5] does not represent the total efficiency of recovering clusters, 
but rather the per detection efficiency of recovering members of a 
cluster. The total chemical tagging cluster detection efficiency is 
related to at least the detection rate, contamination rate, fragmen- 
tation rate, and mean efficiency combined. A simple calculation 
for the Piim = 90 case gives a total efficiency of ~2% (0.33*(1- 
0.22)*(1-0.57)*0.19*100%). This is an efficiency of placing indi- 
vidual stars in their birth clusters; the efficiency of detecting any 
fossil of an existing association would ignore the mean efficiency 
column and hence for Pum ~ 90 evaluates to ~12%. In a million 
star survey we might then expect to place 20,000 stars in associ- 
ations. Given the mean Ndetect of 4.67, this would amount to ap- 
proximately 4300 clusters being recovered in such a survey, to the 
90% confidence level. 

Of course, this is a preliminary number using the literature 



abundance database, as are all numbers listed in Table [5j a more 
realistic approximation would only be possible using the Pg^, cal- 
culated from a GALAH calibration program as described in Section 
14.11 It is also important to note that there is an implicit assumption 
in this calculation that all stars were indeed born in clusters si milar 
to those that we observe today. iBland-Hawthom et ^ ( 1201 Oah sug- 
gest, based on their chemical evolution models, that clusters in the 
low metallicity regime, corresponding to early generations of star 
formation, are more clumped in chemical space, with a wider range 
of abundance variation between clusters. This would certainly re- 
sult in a different probability function, leading to higher chemical 
tagging efficiencies, since the literature abundance database used 
here is restricted to higher metallicities (>~ — 0.35 dex), due 
to the metallicity distribution of Galactic open clusters. However, 
since most stars in the Galactic disk are also in the higher metal- 
licity range, this effect may not be important for most chemical 
tagging experiments. 

6.3 Tools for ctiemical tagging 

The methods used in these mock experiments are not a recommen- 
dation of how to perform group finding on large data-sets. The re- 
quirement of computing the pair-wise Sc metric over thousands, 
or millions, of stars may not be the most efficient way to identify 
structure. They do, however, suggest an algorithm for validating 
and verifying a group of potential coeval stars, G, identified within 
a larger sample of stars (e.g. a Galactic disk field sample) via any 
means: 

1. ) Compute Sc between all pairs of group G. 

2. ) Identify the 5c value, 5^'", corresponding to desired Pum 

from Pg^ . 

3. ) Identify all pairs where Sc < S^cT . 

4. ) All stars making up the pairs from 3.) are set Sc 

5. ) All stars making up the pairs not included in 3.) are set 5„c 

6. ) G is re-evaluated to be the complement of Snc in 5c. 

7. ) Finally, the cluster detection confidence, Pdua, is evaluated 
using the mean of Sc in G. 

Aside from ensuring that all pairs in the resultant group have 
a high probability of being natal siblings, following the preceding 
prescription yields a qualitative assessment of our confidence in 
the cluster detection based on a global knowledge of the chemical 
nature of Galactic open clusters. 

Figure [8] is a graphical representation of the above pipeline, 
for clarity. The overlap in the sets Snc and Sc illustrates the stars 
that comprise both high and low probability pairs. Only the stars 
inside the shaded region of Sc are then included as members in the 
final cluster detection. The final number of Sc pairs shown at the 
bottom of the pipeline will be less than the number of confident 
pairs (those having P > Pum) shown above, if there is overlap 
between Snc and Sc as depicted in this diagram. 

This pipeline can be run efficiently, without the need for 
computing the Sc metric over all pairs, with the density based 
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Figure 8. Flow diagram describing the steps outlined in section l63l for val- 
idation and verification of cluster finding via chemical tagging. The over- 
lap shown between Snc and Sc represents the stars that are in both confi- 
dent and non-confident 5c pairs. Only those in confident and not in non- 
confident (i.e. the green region of the Sc circle) comprise the final cluster 
members. 



group finding algorithm, EnLink, described in lSliarma & JolinstonI 
( l2009l) . The final cluster stars derived from the set Sc. shown in Fig- 
ure[8l would result from a Sc cut in the group detections output by 
EnLink. A further investigation of this application, and the overall 
use of Psq, is forthcoming based on a fully blind chemical tag- 
ging experiment on a homogeneous abundance dataset of several 
hundred thin and thick disk field stars. 



7 SUMMARY & CONCLUSIONS 

We have introduced a metric, the 5c metric, designed to quantify 
the chemical difference of a pair of stars in a single context, re- 
gardless of the number or nature of the measured element abun- 
dances, between distinct pairs. Using a database constructed from 
high-resolution literature abundance studies of Galactic open clus- 
ters, we have analyzed the relative contributions of 5c pairs for 
both intra- and inter-cluster pairs, to arrive at a probability func- 
tion, Ps^. , describing the likelihood that two stars share the same 
evolutionary origin. 

For the specific case of a minimum of 8 chemical dimensions, 
we have derived parameters to a functional form of Psq, whose 
68% confidence level (analogous to a l-tr detection) lies at a 5c 
of ~0.058 dex. This level, in addition to the steep slope present in 
Ps^ , informs the accuracy necessary to distinguish cluster mem- 
bers from polluting stars. We have probed the effect of variations in 
the dimensionality of chemical space and the role of individual el- 
emental species on the probability function. It is clear that limiting 
the number of dimensions present severely reduces the inter-cluster 
variations, and thus the approximately nine species prevalent in our 
analysis are a minimum recommendation. Unfortunately, owing to 
the difficulty of measurement, we could not explore higher dimen- 
sions. Of the key players, the a elements (by proxy), Fe and Ba 
particularly stand out, contributing up to 10% of the Ps^. each. 

We have placed these results in the context of chemical tag- 



ging, and shown how they can be used to validate and verify co- 
eval groups discovered via any means. An important outcome is the 
ability to determine and report a confidence of detection, based on 
our observational understanding of Galactic structure as opposed 
to theoretical models or arbitrary statistical tests. We have utilized 
this ability to highlight some of the challenges involved in a true 
chemical tagging experiment. It is evident that the reconstruction of 
coeval groups of stars, dispersed open clusters, via the technique of 
chemical tagging is a difficult, but plausible, venture. Even though 
it is impossible to reconstruct all such groups, and seems unlikely 
even to discover most, the numbers in Table[5]point to valuable in- 
formation that can be inferred about what cannot be detected, from 
what can. They also speak to the advantages of a very large sample 
of data, given the low efficiencies even at low confidence levels. 

The requirements for internal abundance accuracy of a chem- 
ical tagging survey such as GALAH can be gleaned from the intra- 
cluster probability function. However, this function has been de- 
rived from the currently available set of abundance data in the lit- 
erature which matched our requirements for resolution, elements, 
and number of stars, analysed in several different ways between lit- 
erature studies. With such a heterogeneous sample, considerations 
of systematics effects must not be ignored, and may result in signif- 
icant performance loss as discussed in Sections[3]and|4] The func- 
tion derived in this analysis is only the best guess to date and its 
use requires understanding the caveats discussed in this text. The 
results, however, show that the methods introduced here provide 
a powerful tool for conducting and understanding chemical tag- 
ging in a practical sense. To achieve the best possible confidence 
in cluster reconstruction, a chemical tagging experiment of large 
scale should calibrate against a number of known clusters in order 
to derive P^^ for the specific analysis methods used therein. 

Finally, a pipeline recipe for validation and verification of co- 
eval groups of stars detected using element abundances only, has 
been developed. A logical next step is to use this recipe in a small 
scale chemical tagging experiment to further develop these meth- 
ods in a real world scenario, and to demonstrate how the tool En- 
Link can be applied to this problem. Such an experiment is cur- 
rently in progress by the authors on a large homogeneous sample 
(several hundreds) of stars with a wide range of high-resolution 
abundance measurements. 
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